Processing math: 100%

From Anchor to ROI

layer area

From layer i to layer i+1, assume the parameters on layer i are si (stride), pi (patch), ki (kernel filter size), the width or height of layer i are ri. Then, based on common sense,

ri+1=(ri+2piki)/si+1.

In the reverse process, ri=siri+1si2pi+ki or ri=siri+1si+ki if counting in padding area.

coordinate map

Now consider mapping the point xi on the ROI to the point xi+1 on the feature map, which can be transformed to the layer area problem above. In particular, the receptive field formed by left-up corner and xi on the ROI can be mapped to the region formed by left-up corner and xi+1 on the feature map. Based on the similar formula for the layer area problem above (note the only difference is that we only include left padding and up padding, and subtract the radius of kernel filter (ki1)/2,

xi=sixi+1sipi+ki(ki1)/2.

The above coordinate system starts from 1. When the coordinate system starts from 0,

xi+1=si(xi+1+1)sipi+ki(ki1)/2,

which can be simplified as

xi=sixi+1+(ki12pi).

when pi=floor(ki/2), xi=sixi+1 approximately, which is the simplest case.

By applying xi=sixi+1+(ki12pi) recursively, we can achieve a general solution

x1=αLxL+βL,

in which αL=L1l=1sl and βL=L1l=1(l1n=1sn)(kl12pl)

anchor box to ROI

Given two corner points of an anchor box on the feature map, we can find their corresponding points on the original image, which determine the ROI.